Search results for "Simple random sample"
showing 10 items of 10 documents
Optimal selection of individuals for repeated covariate measurements in follow-up studies
2016
Repeated covariate measurements bring important information on the time-varying risk factors in long epidemiological follow-up studies. However, due to budget limitations, it may be possible to carry out the repeated measurements only for a subset of the cohort. We study cost-efficient alternatives for the simple random sampling in the selection of the individuals to be remeasured. The proposed selection criteria are based on forms of the D-optimality. The selection methods are compared with the simulation studies and illustrated with the data from the East–West study carried out in Finland from 1959 to 1999. The results indicate that cost savings can be achieved if the selection is focuse…
A Procedure for Selecting Representative Subsamples of a Population from a Simple Random Sample
2015
This paper proposes a procedure for selecting large subsamples drawn from a large simple random sample that are more representative of the population under study. By means of the so-called constant of proportionality, the procedure seeks to maximize the size of the subsample taken from a stratified random sample with proportional allocation, restricting it to a p-value high enough to achieve a good fit using Pearson’s chi-square goodness of fit test. The user has the freedom to choose between a larger subsample with poorer adjustment or a smaller subsample with a better fit. We use the Continuous Sample of Working Lives (CSWL), a set of micro data taken from Spanish Social Security records,…
Los emprendedores surgidos de las empresas multinacionales de inversión extranjera directa: un estudio exploratorio en Costa Rica
2014
ResumenEl presente trabajo busca evaluar la creación de empresas por parte de exempleados de empresas multinacionales de inversión extranjera directa. En concreto, se busca dimensionar el fenómeno, caracterizarlo, así como valorar el desempeño de las empresas creadas. El estudio se hizo mediante un muestreo aleatorio simple con margen de error del 7% y nivel de confianza del 95%, sobre una base de datos de 11.120 exempleados de empresas multinacionales en Costa Rica (n=175). Además se utilizó un grupo control ad hoc. Los resultados muestran cómo son estos emprendedores, el proceso creador experimentado, las características y el desempeño de las nuevas empresas.AbstractThe aim of this invest…
Improving the Representativeness of a Simple Random Sample: An Optimization Model and Its Application to the Continuous Sample of Working Lives
2020
This paper proposes an optimization model for selecting a larger subsample that improves the representativeness of a simple random sample previously obtained from a population larger than the population of interest. The problem formulation involves convex mixed-integer nonlinear programming (convex MINLP) and is, therefore, NP-hard. However, the solution is found by maximizing the size of the subsample taken from a stratified random sample with proportional allocation and restricting it to a p-value large enough to achieve a good fit to the population of interest using Pearson&rsquo
Dynamic Phase Diagram of the REM
2019
International audience; By studying the two-time overlap correlation function, we give a comprehensive analysis of the phase diagram of the Random Hopping Dynamics of the Random Energy Model (REM) on time-scales that are exponential in the volume. These results are derived from the convergence properties of the clock process associated to the dynamics and fine properties of the simple random walk in the $n$-dimensional discrete cube.
Selection of Large Sub-Samples from the Continuous Sample of Working Lives Representative of the Benefits Provided by the Spanish Public Pension Syst…
2016
The Continuous Sample of Working Lives (CSWL) is a set of anonymized microdata with information about individuals taken from Spanish Social Security records. It provides very valuable information, which is used in many studies on labor economics and in the analysis of the Spanish public pension system. This article presents two major contributions: The first is an analysis of how representative CSWL is of the population of pensioners for the period 2005-2013. It is concluded that the CSWL does not follow the same distribution as the population with respect to some types of benefits, and that this happens in most waves. One of the reasons is that it is obtained by simple random sampling, so …
Horvitz-Thompson estimators for functional data: asymptotic confidence bands and optimal allocation for stratified sampling
2009
When dealing with very large datasets of functional data, survey sampling approaches are useful in order to obtain estimators of simple functional quantities, without being obliged to store all the data. We propose here a Horvitz--Thompson estimator of the mean trajectory. In the context of a superpopulation framework, we prove under mild regularity conditions that we obtain uniformly consistent estimators of the mean function and of its variance function. With additional assumptions on the sampling design we state a functional Central Limit Theorem and deduce asymptotic confidence bands. Stratified sampling is studied in detail, and we also obtain a functional version of the usual optimal …
A Bayesian comparison of cluster, strata, and random samples
1999
When sampling from finite populations, simple random sampling (SRS) is rarely used in practice, due to either high cost or information to be gained from more efficient designs. Bayesian hierarchical models are a natural framework to model the non-randomness in the sample. This paper concentrates on the effects that the design has on inference about characteristics of the finite population, and makes a critical comparison among some common designs.
Using Complex Surveys to Estimate theL1-Median of a Functional Variable: Application to Electricity Load Curves
2012
Mean proles are widely used as indicators of the electricity consumption habits of customers. Currently, Electricit e De France (EDF), estimates class load proles by using point-wise mean function. Unfortunately, it is well known that the mean is highly sensitive to the presence of outliers, such as one or more consumers with unusually high-levels of consumption. In this paper, we propose an alternative to the mean prole: the L1-median prole which is more robust. When dealing with large datasets of functional data (load curves for example), survey sampling approaches are useful for estimating the median prole and avoid storing all of the data. We propose here estimators of the median trajec…
The Tax Justice Network-Africa v Cabinet Secretary for National Treasury & 2 Others: A Big Win for Tax Justice Activism?
2019
This paper develops an optimization model for selecting a large subsample that improves the representativeness of a simple random sample previously obtained from a population larger than the population of interest. The problem formulation involves convex mixed-integer nonlinear programming (convex MINLP) and is therefore NP-hard. However, the solution is found by maximizing the “constant of proportionality” – in other words, maximizing the size of the subsample taken from a stratified random sample with proportional allocation – and restricting it to a p-value high enough to achieve a good fit to the population of interest using Pearson’s chi-square goodness-of-fit test. The beauty of the m…